Overview

Dataset Statistics

Number of Variables 12
Number of Rows 3584
Missing Cells 39
Missing Cells (%) 0.1%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 956.1 KB
Average Row Size in Memory 273.2 B
Variable Types
  • Categorical: 5
  • Numerical: 7

Dataset Insights

ID is uniformly distributed Uniform
Angle is skewed Skewed
Capacity is skewed Skewed
CapacityFactor is skewed Skewed
Temp(°Cd) is skewed Skewed
Date has a high cardinality: 507 distinct values High Cardinality
Set has constant value "train" Constant
Set has constant length 5 Constant Length
Date has constant length 10 Constant Length
Angle has 1131 (31.56%) negatives Negatives
Angle has 444 (12.39%) zeros Zeros
  • 1
  • 2

Variables


Set

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 250880

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row train
2nd row train
3rd row train
4th row train
5th row train

Letter

Count 17920
Lowercase Letter 17920
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Set has words of constant length

ID

numerical

Approximate Distinct Count 3584
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 57344
Mean 1792.5
Minimum 1
Maximum 3584
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ID is uniformly distributed

Quantile Statistics

Minimum 1
5-th Percentile 180.15
Q1 896.75
Median 1792.5
Q3 2688.25
95-th Percentile 3404.85
Maximum 3584
Range 3583
IQR 1791.5

Descriptive Statistics

Mean 1792.5
Standard Deviation 1034.756
Variance 1.0707e+06
Sum 6.4243e+06
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.5773
  • ID is not normally distributed (p-value 9.122646544633201e-08)

Date

categorical

Approximate Distinct Count 507
Approximate Unique (%) 14.1%
Missing 0
Missing (%) 0.0%
Memory Size 268800

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2020-06-09
2nd row 2020-06-10
3rd row 2020-06-11
4th row 2020-06-12
5th row 2020-06-13

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 7168
Decimal Number 28672
  • Date has words of constant length

Lat

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 251668

Length

Mean 5.2199
Standard Deviation 0.4142
Median 5
Minimum 5
Maximum 6

Sample

1st row 25.11
2nd row 25.11
3rd row 25.11
4th row 25.11
5th row 25.11

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 15124

Lon

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 254072

Length

Mean 5.8906
Standard Deviation 0.3122
Median 6
Minimum 5
Maximum 6

Sample

1st row 121.26
2nd row 121.26
3rd row 121.26
4th row 121.26
5th row 121.26

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 17528

Angle

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 57344
Mean -20.5756
Minimum -160
Maximum 22
Zeros 444
Zeros (%) 12.4%
Negatives 1131
Negatives (%) 31.6%
  • Angle is skewed left (γ1 = -1.7332)

Quantile Statistics

Minimum -160
5-th Percentile -160
Q1 -31
Median 1.76
Q3 4.63
95-th Percentile 22
Maximum 22
Range 182
IQR 35.63

Descriptive Statistics

Mean -20.5756
Standard Deviation 53.0587
Variance 2815.2287
Sum -73742.82
Skewness -1.7332
Kurtosis 1.6713
Coefficient of Variation -2.5787
  • Angle is not normally distributed (p-value 4.835734560090639e-10)
  • Angle has 635 outliers

Module

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 287230
  • The largest value (AUO PM060MW3 320W) is over 1.84 times larger than the second largest value (MM60-6RT-300)

Length

Mean 15.1423
Standard Deviation 2.3043
Median 17
Minimum 12
Maximum 17

Sample

1st row MM60-6RT-300
2nd row MM60-6RT-300
3rd row MM60-6RT-300
4th row MM60-6RT-300
5th row MM60-6RT-300

Letter

Count 23156
Lowercase Letter 0
Space Separator 4252
Uppercase Letter 23156
Dash Punctuation 3232
Decimal Number 23630
  • The top 2 categories (AUO PM060MW3 320W, MM60-6RT-300) take over 50.0%

Capacity

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 57344
Mean 350.535
Minimum 99.2
Maximum 499.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Capacity is skewed left (γ1 = -0.4496)

Quantile Statistics

Minimum 99.2
5-th Percentile 99.2
Q1 246.4
Median 352
Q3 498.56
95-th Percentile 499.8
Maximum 499.8
Range 400.6
IQR 252.16

Descriptive Statistics

Mean 350.535
Standard Deviation 144.4989
Variance 20879.9299
Sum 1.2563e+06
Skewness -0.4496
Kurtosis -1.1286
Coefficient of Variation 0.4122
  • Capacity is not normally distributed (p-value 3.440561087014017e-18)

CapacityFactor

numerical

Approximate Distinct Count 3074
Approximate Unique (%) 85.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 57344
Mean 3.8876
Minimum 0.07757
Maximum 21.4431
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • CapacityFactor is skewed left (γ1 = -0.2824)

Quantile Statistics

Minimum 0.07757
5-th Percentile 0.88
Q1 2.9925
Median 4.2938
Q3 5.0392
95-th Percentile 5.7301
Maximum 21.4431
Range 21.3655
IQR 2.0467

Descriptive Statistics

Mean 3.8876
Standard Deviation 1.5382
Variance 2.3661
Sum 13933.2988
Skewness -0.2824
Kurtosis 4.1432
Coefficient of Variation 0.3957
  • CapacityFactor is not normally distributed (p-value 1.2950359304479467e-07)
  • CapacityFactor has 1 outliers

Generation(kWd)

numerical

Approximate Distinct Count 1990
Approximate Unique (%) 55.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 57344
Mean 1339.4838
Minimum 17
Maximum 6752
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Generation(kWd) is skewed right (γ1 = 0.3935)

Quantile Statistics

Minimum 17
5-th Percentile 259.15
Q1 575
Median 1268
Q3 1957
95-th Percentile 2708
Maximum 6752
Range 6735
IQR 1382

Descriptive Statistics

Mean 1339.4838
Standard Deviation 796.6985
Variance 634728.5468
Sum 4.8007e+06
Skewness 0.3935
Kurtosis -0.441
Coefficient of Variation 0.5948
  • Generation(kWd) has 1 outliers

Irradiance(kWd/m2)

numerical

Approximate Distinct Count 961
Approximate Unique (%) 27.0%
Missing 24
Missing (%) 0.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 56960
Mean 4.7966
Minimum 0.03611
Maximum 8.0056
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWd/m2) is skewed left (γ1 = -0.6519)

Quantile Statistics

Minimum 0.03611
5-th Percentile 1.0719
Q1 3.6583
Median 5.2139
Q3 6.2479
95-th Percentile 7.3056
Maximum 8.0056
Range 7.9694
IQR 2.5896

Descriptive Statistics

Mean 4.7966
Standard Deviation 1.8943
Variance 3.5885
Sum 17075.9
Skewness -0.6519
Kurtosis -0.4833
Coefficient of Variation 0.3949

Temp(°Cd)

numerical

Approximate Distinct Count 221
Approximate Unique (%) 6.2%
Missing 15
Missing (%) 0.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 57104
Mean 25.7228
Minimum 6.9
Maximum 32.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Temp(°Cd) is skewed left (γ1 = -0.97)

Quantile Statistics

Minimum 6.9
5-th Percentile 15.5
Q1 22
Median 28.2
Q3 30
95-th Percentile 30.9
Maximum 32.5
Range 25.6
IQR 8

Descriptive Statistics

Mean 25.7228
Standard Deviation 5.3412
Variance 28.5285
Sum 91804.7
Skewness -0.97
Kurtosis -0.1549
Coefficient of Variation 0.2076
  • Temp(°Cd) is not normally distributed (p-value 5.868983748359949e-10)
  • Temp(°Cd) has 19 outliers

Interactions

Correlations

Missing Values